17:30
2026-06-14
lesswrong.com
artificial-intelligence
Can a stronger model fake being a weaker one? Mostly not
A new study finds that stronger AI models can imitate weaker predecessors' mistakes only in narrow cases, such as GPT-5.4 mimicking GPT-4o on math questions, but generally fail to impersonate specificβ¦